Machine Learning Project Report
نویسندگان
چکیده
The blog data we have for the project has already been tokenized, more specifically, the data consist of two parts, training vectors and dictionary. Each training sample is represented as a vector, with approximately 1000 components and each component as a word id mapping to the word in the dictionary. However, the dictionary we have is quite crude. Many words are actually closely related, with rather similar meanings and could be seen as the same word. Also there are some random signs and characters with no specific meanings, such as: u
منابع مشابه
TBM Tunneling Construction Time with Respect to Learning Phase Period and Normal Phase Period
In every tunnel boring machine (TBM) tunneling project, there is an initial low production phase so-called the Learning Phase Period (LPP), in which low utilization is experienced and the operational parameters are adjusted to match the working conditions. LPP can be crucial in scheduling and evaluating the final project time and cost, especially for short tunnels for which it may constitute a ...
متن کاملAssisting Software Projects with Bug Report Assignment Recommender Creation
Software development projects receive many change requests each day and each report must be examined to decide how the request will be handled by the project. One decision that is frequently made is to which software developer to assign the change request. Efforts have been made toward semi-automating this decision, with the most promising approaches using machine learning algorithms. However, ...
متن کاملBridging the semantic gap for software effort estimation by hierarchical feature selection techniques
Software project management is one of the significant activates in the software development process. Software Development Effort Estimation (SDEE) is a challenging task in the software project management. SDEE is an old activity in computer industry from 1940s and has been reviewed several times. A SDEE model is appropriate if it provides the accuracy and confidence simultaneously before softwa...
متن کاملCS 229 = = Final Project Report SPEECH & NOISE SEPARATION
In this course project I investigated machine learning approaches on separating speech signals from background noise. Keywords—MFCC, SVM, noise separation, source separation, spectrogram
متن کاملمروری بر روشهای تخمین هزینه نرمافزار مبتنی بر یادگیری ماشین
Software project management software is the most important activity in software development, because it contains the whole software development process, from beginning to end. Software cost estimation is a challenge task in the software project management. It is an old activity in computer industry from 1940s and has been developed many times. Effort, only covers part of the cost of a software ...
متن کاملSupporting Integrated Tourism Services with Semantic Technologies and Machine Learning
In this paper we report our ongoing work on the application of semantic technologies and machine learning to Integrated Tourism in the Apulia Region, Italy, within the Puglia@Service project.
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
عنوان ژورنال:
دوره شماره
صفحات -
تاریخ انتشار 2009